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Abstract 

We prove that the spectral gap of the Swendsen-Wang dynamics for the 
random-cluster model on arbitrary graphs with m edges is bounded above by 
16m log m times the spectral gap of the single-bond (or heat-bath) dynamics. 
This and the corresponding lower bound (from |U12j ) imply that rapid mixing 
of these two dynamics is equivalent. 

Using the known lower bound on the spectral gap of the Swendsen-Wang 
dynamics for the two dimensional square lattice Z| of side length L at high 
temperatures and a result for the single-bond dynamics on dual graphs, we 
obtain rapid mixing of both dynamics on at all non-critical temperatures. 
In particular this implies, as far as we know, the first proof of rapid mixing 
of a classical Markov chain for the Ising model on at all temperatures. 

1 Introduction 

Markov chains for the random-cluster model and the closely related g-state Potts 
model are the topic of many research articles from various areas of mathematics 
and statistical physics. The probably most studied model is the Ising model (or 
2-state Potts model) on the two-dimensional square lattice. While there is almost 
complete knowledge about the mixing properties of single-spin dynamics, such as 
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the heat-bath dynamics, there are only a few results on cluster algorithms, such 
as Swendsen-Wang, or dynamics on the corresponding random-cluster model. The 
single-spin dynamics on i.e. the two dimensional square lattice of side length L, 
is known to mix rapidly above the critical temperature |M094] and below the critical 
temperature the mixing time is exponential in the side length L, see |CGMS96] . See 
also |Mar99j for an excellent survey of the (at that time) known results. Only 
recently it was proven by Lubetzky and Sly [LSlOj that the single-spin dynamics 
is also rapidly mixing at the critical temperature. One approach to overcome the 
torpid (or slow) mixing at low temperatures was to consider cluster algorithms that 
change the spin of a large portion of vertices at once. The most successful approach 
(so far) is the Swendsen-Wang dynamics (SW) that is based on the close relation 
between Potts and random-cluster models, see |SW87j and |ES88j . But although it is 
conjectured that this dynamics is rapidly mixing at high and at low temperatures, 
again most of the results concern high temperatures. Known results for general 
graphs include rapid mixing on trees and complete graphs at all temperatures, see 
e.g. |CF98j . |CDFROO] and |LNP07j . and graphs with small maximum degree at 
high temperatures , see |CF98j and |Hub03] . Additionally, it is proven that, for 
bounded degree graphs, rapid mixing of single-spin dynamics implies rapid mixing 
of SW [Ullj . At low temperatures there are only a few results on the mixing time 
of SW. Beside the results for trees and complete graphs, we are only aware of two 
articles concerning the low temperature case, |Mar92j and |Hub03] . While Huber 
|Hub03] states rapid mixing for temperatures below some constant that depends on 
the size of the graph, Martinelli |Mar92j gave a result for hypercubic subsets of Z*^ 
at sufficiently low temperatures that do not depend on the side length. Additionally 
to the rapid mixing results for SW there are some results on torpid mixing. These 
include torpid mixing for the g-state Potts model at the critical temperature on the 
complete graph for all g > 3 |GJ97j and on hypercubic subsets of Z"^ for q sufficiently 
large |BCT10] . 

In this article we study the mixing properties of the Swendsen-Wang and the 
heat-bath dynamics for the random-cluster model (In fact, SW can be seen as a 
Markov chain for random-cluster and Potts models.) and we prove that the spectral 
gap of SW is bounded above by some polynomial in the size of the graph times the 
spectral gap of the heat-bath dynamics. In particular, this implies rapid mixing of 
SW for the Potts model on the two-dimensional square lattice at all temperatures 
below the critical one. 

To state our results in detail, we first have to define the models and the algo- 
rithms. Let G = (V, E) be a graph with finite vertex set V and edge set E. The 
random- cluster model (also known as the FK-model) with parameters p G (0, 1) and 
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q E N, see Fortuin and Kasteleyn [FK72j . is defined on the graph G by its state 
space f^Rc = {A : A C E} and the RC measure 



where c{A) is the number of connected components in the graph {V,A), counting 
isolated vertices as a component, and Z is the normahzation constant that makes 
/i a probabihty measure. Note that this model is well-defined also for non-integer 
values of q, but we do not need this generalization here. See |Gri06j for further 
details and related topics. 

A closely related model is the q-state Potts model on G at inverse tempera- 
ture /3 > 0, that is defined as the set of possible configurations flp = [q]^ , where 
[q] :={!,..., q} is the set of colors (or spins), together with the probability measure 



H^) ■■= ^IM) = z(G 1 - e-P a) """P r 5Z = ^(^)) 

^ ' '^^ [ u,v:{u,v}&E 

for a G fip, where Z(-, •, •) is the same normalization constant as for the RC model 
(see |Gri06t Th. 1.10]). For q = 2 this model is called Ising model. 

The connection of these models is given by a coupling of the RC and the Potts 
measure in the case p = 1 — . Let cr G fip and A G ^rc- Then the joint measure 
of (o". A) E VLi := VL-p X f^Rc is defined by 

1 / n ^1^1 



where 



E{a) := f^{u,v}eE: a{u) = a{v)y 



The marginal distributions of u are exactly tt and /i, respectively, and we will call u 
the FKES (Fortuin-Kasteleyn-Edwards-Sokal) measure, see e.g. [ES88j and |Gri06j . 

The Swendsen- Wang dynamics (SW) uses this coupling implicitly in the follow- 
ing way. Suppose the SW at time t is in the state At G I^rc- We choose at G Qp with 
respect to the measure i^(-,ylt), i.e. every connected component of {V,At) is colored 
independently and uniformly at random with a color from [q]. Then take E{at) and 
delete each edge independently with probability 1 — p to obtain At+i G ^rc, which 
can be seen as sampling from z/((Tf, ■). Denote by Psw the transition matrix of this 
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Markov chain. Of course, we can make these two steps in reverse order to obtain a 
Markov chain for the g-state Potts model with transition matrix Psw- 

The heat-bath dynamics (HB) for the random-cluster model is a local Markov 
chain that, given the current state At G ^rc, sets At+i = At with probability ^ 
and otherwise chooses a edge e ^ E uniformly at random and changes the state at 
most at the edge e with respect to the conditional measure given all the other edges, 
which is sampling of At+i from the conditional measure fi{- \ {At U e,At\ e}). The 
transition matrix of this chain is denoted Phb- 

The spectral gap of a Markov chain with transition matrix P is defined by 

A(P) := 1 — maxjl^l : ^ is an eigenvalue of P, ^ 7^ l|. 

We prove 

Theorem 1. Let Psw (resp. Prb) be the transition matrix of the Swendsen-Wang 
(resp. heat-bath) dynamics for the random- cluster model on a graph with m edges. 
Then 

A(Psw) < 16mlog(m) A(Phb). 

Using the corresponding lower bound, which was proven in |U12j we obtain that 
SW is rapidly mixing if and only if HB is rapidly mixing, since the spectral gaps 
can differ only by a polynomial in the number of edges of the graph. Furthermore 
we prove that the heat-bath dynamics for the RC model on a planar graph G with 
parameters p and q has the same spectral gap than the heat-bath dynamics for the 
dual model, which is the random-cluster model on the dual graph G"'' (see Section 
15.11 for definitions) with parameters p* and q, where p* satisfies ^3-^ = ■ We 

denote the dynamics for the dual model by Pj'jg (resp. Pjw)- This was probably 
known before, but we could not found a reference. It follows 

Corollary 2. Let Psw be the transition matrix of the Swendsen-Wang dynamics for 
the random-cluster model on a planar graph G with m edges and let Pj^ be the SW 
dynamics for the dual model. Then there exists a constant c < 16min{g, jz^}, such 
that 

A(Psw) < cmlog(m) A(PsV)- 

If we consider the two-dimensional square lattice of side length L, i.e. the graph 
Zi = {Vl^El) with Vl = [Lf and El = {{u,v} G (^^) : \ui-vi\ + \u2-V2\ = l}, 
we can deduce the following from the results of |U11] . 
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Theorem 3. Let Psw be the transition matrix of the Sw ends en- Wang dynamics for 
the random- cluster model on Z| with parameters p and q. Let N = L"^ . Then there 
exist constants Cp, c',C < oo such that 

• A(Psw)"^ < CpN forp<p^{q), 

• A(Psw)-' < CpNHogN forp>p,{q), 

• A(Psw)"^ < c'N^ forq = 2 and p = ^^(2), 
where pc{q) = 

An immediate consequence is the following corollary. 

Corollary 4. Let Psw be the transition matrix of the Swendsen- Wang dynamics for 
the q-state Potts model on Z| at inverse temperature (3. Let N = L^. Then there 
exist constants cp,c\C < oo such that 

• A(Pgw)-i < CpN forf3<f3,{q), 

• Khw)-' < CpNHogN forP>P,{q), 

• A(Psw)~^ < c'N^ forq = 2andf3 = f3c{2), 
where f3c{q) = log(l + y^). 

This seems to be the first prove of rapid mixing of a classical Markov chain for 
the Ising model at all temperatures. In fact, in [Ullj a somehow artificial Markov 
chain, that makes a additional step at the dual graph, is proven to be rapid. 

We also obtain for the heat-bath dynamics 

Theorem 5. Let Phb be the transition matrix of the heat-bath dynamics for the 
random- cluster model on with parameters p and q. Let N = L"^ . Then there 
exist constants Cp,c',C < oo such that 

• A(Phb)~' < CpN^logN forp^p,{q), 

• A(Phb)"^ < c'N^ forq = 2 and p = p^{2), 
where pc{q) = 
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The results of |Ullj , and hence the proofs of Theorems [3] and rely ultimately 
on the rapid mixing results for the heat-bath dynamics for the Potts (resp. Ising) 
model that were proven over the last decades. To state only some of them, see e.g. 
[LSTO] . |M094j . |MOS94j l and |Ale98j together with the proof of exponential decay 
of connectivities up to the critical temperature from |BD10] . These articles give 
an almost complete picture over what is known so far about mixing of single-spin 
dynamics in Z|. 

The plan of this article is as follows. In Section [21 we introduce the necessary 
notation related to the spectral gap of Markov chains. Section [3] contains a more 
detailed description of the algorithms and the definition of the "building blocks" 
that are necessary to represent the dynamics on the FKES model. In Section H] we 
will prove Theorem [H and in Section \5\ we introduce the notion of dual graphs and 
prove the remaining results from above. 

2 Spectral gap and mixing time 

As stated in the introduction, we want to estimate the efficiency of Markov chains. 
For an introduction to Markov chains and techniques to bound the convergence 
rate to the stationary distribution, see e.g. |LPW09] . In this article we consider 
the spectral gap as measure of the efficiency. Let P be the transition matrix of a 
Markov chain with state space Q that is ergodic, i.e. irreducible and aperiodic, and 
has unique stationary measure tt. Additionally let the Markov chain P be reversible 
with respect to tt, i.e. 



Then we know that the spectral gap of the Markov chain can be expressed in 
terms of norms of the (Markov) operator P that maps from L2(7r) := (M^,7r) to 
L2{n), where inner product and norm are given by (/, 5'),r = Sxen 
and 11/11^ := /(a;)^7r(x), respectively. The operator is defined by 



and represents the expected value of the function / after one step of the Markov 
chain starting in x G fi. The operator norm of P is 



7i{x) P{x,y) 



iT{y) P{y, x) for all x,y & Q. 




(1) 



yen 



P 



TT 



^||L2{7r)^L2(7r) 



max IIP/IU 
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and we use || ■ interchangeably for functions and operators, because it will be clear 
from the context which norm is used. It is well known that A(P) = 1 — ||P — S^H^ 
for reversible P, where STr{x,y) = vr(?/), and that reversibility of P is equivalent to 
self-adjointness of the corresponding Markov operator, i.e. P = P*, where P* is 
the {adjoint) operator that satisfies {f.Pg)^, = {P*f,g)T, for all f,g E L2{n). The 
transition matrix that corresponds to the adjoint operator satisfies 

P*(x,y) = ^Piy,x). 
7r[x) 

If we are considering a family of state spaces with a corresponding 

family of Markov chains {PnjneN? we say that the chain is rapidly mixing for the 
given family if A(P„)"^ = (9(log(|r2„|)'^) for all G N and some C < oo. 

In several (or probably most of the) articles on mixing properties of Markov 
chains the authors prefer to use the mixing time as measure of efficiency, which is 
defined by 

t(P) := min < t : max |P*(x, y) — 7r(y) I < - 
xen ' ' e 

The mixing time and spectral gap of a Markov chain (on finite state spaces) are 
closely related by the following inequality, see e.g. |LPW09( Theorem 12.3 & 12.4]. 



Lemma 6. Let P he the transition matrix of a reversible, ergodic Markov chain with 
state space Q and stationary distribution tt. Then 

KPy' - 1 < r(P) < log (^) A(P)-\ 

where ir^in := min^envr(x). 

In particular, we obtain the following for the random-cluster model. 



Corollary 7. Let P be the transition matrix of a reversible, ergodic Markov chain 
for the random- cluster model on G = {V, E) with parameters p and q. Then 

A(P)-^-l < r(P) < (^2 + |E|log^^^^ + |y|logg^ A(P)-\ 

Therefore, all results of this article can also be written in terms of the mixing 
time, loosing the same factor as in Corollary [71 
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3 Joint representation of the algorithms 



In order to make our description of the considered Markov chains complete, we 
state in this section formulas for their transition matrices. Additionally, we intro- 
duce another local Markov chain that will be necessary for the further analysis and 
introduce a representation of the dynamics on (joint) FKES model. 

The Swendsen-Wang dynamics (on the RC model), as stated in the introduction, 
is based on the given connection of the random cluster and Potts models and has 
the transition matrix 

Psw{A,B) = q-<^^ (t^)' ' E (l-p)"'^'^^'l(^Ui?cE(a)). (2) 

Recall that we denote by Psw the Swendsen-Wang dynamics for the Potts model 
and note that both dynamics have the same spectral gap, see |U11] . 

The second algorithm we want to analyze is the (lazy) heat-bath dynamics. Let 

A G Q^c be given and denote by A (resp. ^) connected (resp. not connected) in 
the subgraph {V,A). Additionally we use throughout this article AU e instead of 
A U {e} (respectively for fl, \) and denote the endpoints of e by e*^^^ and e^'^\ i.e. 
e = {e^^\e^'^^}. For A, B e ^rc, A ^ B, the transition probabilities of the HB 
dynamics are given by 

^hb(AB) :^ J^^Y. ^^^^^^Ifl^^^^^ MAeBce), (3) 

where denotes the symmetric difference and P}ib{A,A) is chosen such that Phb 
is stochastic. Hence, Phb satisfies 

A U e and e^^) e^^) 
A \ e and e^^) A e^^) 
A U e and e^^) 4A e^^) 
A\e and e^^^ e^^) 

for A B. The heat-bath dynamics has the advantage that the corresponding HB 
dynamics for the dual model has the same spectral gap, see Section 15. 1[ Unfortu- 
nately, this Markov chains do not admit a representation on the joint model like the 



Phb{A,B) 



2\E 



E 



P, 

1 -p, 

P 

p+q(l-p) ' 

g(l-p) 
V p+q(l-p) ' 



for B 
for B 
for B 
for B 
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SW dynamics. Therefore we introduce the following (non-lazy) local dynamics with 
transition probabilities 



for B = AUe and e^^) ^ e^^) 

for B = A\e and e^^^ A e^^) 

. (4) 
for 5 = A U e and e^^) ^ e^^) 

ioY B = A\e and e^^^ -(^M- e^^^ 

We call this Markov chain the single-bond dynamics (SB). This chain is inspired 
by the Swendsen-Wang dynamics since Psw = -Psb for a graph that consists of two 
vertices connected by a single edge. Note that Psw, -Phb and Psb are reversible with 
respect to /i. 

Before we state the representation of Psw and Psb on the FKES model, we show 
that the spectral gaps of Psb and Phb are closely related. For this let I{A, B) := 
1(^4 = P) and note that |(/ + Psb) is the transition matrix of the lazy single-bond 
dynamics. 



Psb{A,B) 



1 

IpT 



E 



1 - 

P 

1 - 



P, 



Lemma 8. For Pub o^nd Psb for the random- cluster model with parameters p and 
q we have 

^a(Psb) < a(Phb) < (i-p(i-g-^))~' a(^^) 

Proof. Using standard comparison ideas, e.g. from |DSC93t Section 2. A], we obtain 
that for two transition matrices P and Q, P{A,B) < cQ{A,B) for all A,B& fi^c 
implies A(P) < cX{Q), where for lazy Markov chains the inequality for all A ^ B 
is sufficient. Additionally we have in general A(P) < 2A(|(/ + P)). Therefore it is 
enough to prove |(/ + Psb)(A5) < PuBiA, B) < (l-p(l-g-i))-ii(/ + PsB)(A, P) 
for all A ^ B, which is easy to check. 

□ 

We want to represent the Swendsen-Wang and the single-bond dynamics on the 
FKES model, which consists of the product state space Qj := Qp x Q^ic and the 
FKES measure u. This was done first in |U12] and we follow the steps from this 
article. First we introduce the stochastic matrix that defines the mapping (by matrix 
multiplication) from the RC to the FKES model 

M(P, (a. A)) := g-^(^) 1{A = B)1{Bc E{a)) . (5) 
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Note that M defines an operator (like in ([T])) that maps from L2{i') to i^2(/u) and 
its adjoint operator M* can be given by the (stochastic) matrix 

M*{{a,A),B) = t{A = B). 

The following matrix represents the updates of the RC "coordinate" in the FKES 
model. For (a, A), (r, B) G fij and e = {e^^^, e^^)} e E let 

{p, = A U e and a{e^^^) = a{e^'^^) 

1-p, 5 = A\e and a(eW) = a(e(2)) (6) 
1, B = A\e and (T(eW) ^ ^(e^^)). 

The following simple lemma shows some interesting properties of the matrices 
from and ([H]), e.g. {Te}ee_B is a family of commuting projections in L2{i'). This 
will be important in the proof of the main result. 

Lemma 9. Let M , M* and he the matrices from above. Then 

(i) M*M and Tg are self-adjoint in -^2(2/). 

(ii) TgTe = Tg and T^Tei = T^'Te for all e, e' G E. 

(iii) ||Te||^ = 1 and \\M*M\\^ = 1. 

Now we can state the desired Markov chains with the matrices from above. 

Lemma 10. Let M , M* and Tg he the matrices from above. Then 

(i) Psw = m(y{T^ M*. 

(ii) PsB = ^ E MTgM* = M E Te) M*. 

From Lemma[n](ii) we have that the order of multiplication in {i) is unimportant. 
Lemma [9] and [TO] were proven in |U12| Lemma 3&4], but note that Psb in this article 
is the lazy version of Psb from here. 
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4 Proof of Theorem [T] 



In this section we will prove Theorem [T] This is done in two subsections. In the first 
one we prove some general norm estimates for operators on (resp. between) Hilbert 
spaces. In the second subsection we will apply these estimates to the setting from 
above to obtain the result. 

4.1 Technical lemmas 

In this section we provide some technical lemmas that will be necessary for the 
analysis. We state them in a general form, because we guess that they could be 
useful also in other settings. First let us introduce the notation. Throughout this 
section consider two Hilbert spaces Hi and H2 with the corresponding inner products 
{■,-)hi and {■,-)h2- The norms in Hi and H2 are defined as usual as the square root 
of the inner product of a function with itself. Additionally, we denote by || ■ \\hi 
(resp. II ■ \\h2^Hi) the operator norms of operators mapping from Hi to Hi (resp. 
H2 to Hi). We consider two bounded, linear operators, R and T. The operator 
R : H2 ^ Hi maps from H2 to Hi and has the adjoint R*, i.e. i?* : ifi — ?■ H2 with 
{R*f,g)H2 = {fyRg)Hi for all f & Hi and g e H2. The operator T : H2 ^ H2 is 
self-adjoint and acts on H2. Obviously, RTR* is then self-adjoint on Hi. 



Lemma 11. In the setting from above let T be also positive, i.e. {Tg,g)H2 > 0, 
then 



\RT^+^R 



< IITII 

\Hi - 11^ \\H2 



RT^R* 



In the special case k = 1 this lemma was used in [U12j to prove a lower bound 
on the spectral gap of SW. We will recall this result later. 

Proof. By the assumptions, T has a unique positive square root T, i.e. T = TT, 
which is again self-adjoint, see e.g. [Kre78[ Th. 9.4-2]. We obtain 

2 

H2^Hi 
''2k 



ll^yfc+l^*ll ^ 


^y2fc+2^* 






M \l Hi 




Hi 





< 



RT^ 
T 



T 



H2 



2 

H2^Hi 

\\RT^R*\ 



H2 

Hi ■ 



RT^^R 



\T\ 



Hi 



H2 



□ 
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In particular, if \\T\\jj < 1 this proves monotonicity in k. 



Lemma 12. In the setting from above let additionally WRWIj^^h-^ = \\RR*\\hi — ^' 
then 



rt^'r* 



Hi 



for all k eN. 



Proof. The case = is obvious. Now suppose the statement is correct for k — 1, 
then 



IRTR* 



I Hi 



\RTR*f^^ < 



< 
< 



RT' 

rt^'r 



R 



RT^ R* 

*l|2 



2 

Hi 



H2^Hi 



Hi 



\Hi^H2 



\RR*\ 



Hi 



H2 



which proves the statement for k. 



□ 



The next corollary combines the statements of the last two lemmas to give a 
result similar to Lemma [12] for arbitrary exponents. 



Corollary 13. Additionally to the general assumptions of this section let T be pos- 
itive, ||T||^^ < 1 and \\RR*\\jj_^ < 1. Then 



\RTR*fj!^^ < \\RT''R 



k D* I 



\Hi 



for all k eN. 

Proof. Let / = [log2 fcj such that | < 2' < /c. Since ||i?Ti?*||^^ < 1 by assumption, 
we obtain 



\RTR*fj^^ < \\RTR*\\l^ < 



rt^'^'r* 



< \\RT''R*\ 



Hi 



Hi 



□ 



12 



4.2 Proof 

In this section we apply the estimates from the last one. Recall that we consider 
the dynamics on a graph G = {V,E) with m edges, i.e. m = \E\. Fix an arbitrary 
ordering ei, . . . ,6^ of the edges e E E. We set the Hilbert spaces from the last 
section to Hi = L2{fi) and H2 = L2{i') and define the operators 

T := - Y Te, 

1=1 

and 

m 

r := 

i=l 

with Te from Note that Psb = MTM* and Pgw = MTM* by Lemma [H 
Additionally we define 

m 

-t; := n 

for a G N™. By Lemma [n](ii) we obtain for a,7 G that 7^ = 7^ if and only if 
{i : = 0} = {i : 7i = 0}. Furthermore, 7^ = T for every a G with > for 
alH = 1, . . . , m. We prove the following theorem. 



Theorem 14. Let k = \m log ^] and R : H2 ^ Hi be a bounded, linear operator 

i?r^i?*|L < (1 -£) IIptpIL + e. 



with \\RR*\\^^ < 1. Then 



Proof. Define the index sets Im,k '■= {« £ : X^I^i = ^} ^ := {a G Im,k '■ 

> 0, Vz = 1, . . . , m}. Let I^ ,, := 1^,^ \ J^ ^ and denote by (^), for a G the 
multinomial coefficient. Obviously (by the multinomial theorem). 



a 



and 
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We write 

m" ^-^ \a I m'^ ^-^ \a I 

Note that we use for the second equahty that the Te's are commuting by Lemma[9](ii). 
Since we know that 7^ = T for every a G I^f^ (note that = for k < m) and 
\\RTaR*\\jj^ < 1 for every a G Im,ki we obtain 



\RT'R*\\^ < ^ y C") \\RrR*\\„ + ^ y C") \\RraR*\ 

I iiHi - \ Q, y II iiHi ^k II ° I 



m,A: m.fe 



< 1 1 _ ^ ) ||i?ri?* 



^ ^m,k 

/II ii//i TT^fc 



Using (1 — a)c + a < (1 — 6)c + 6 for c < 1 and a < b, and ||-RT-R*||//i < 1 it follows 
\\RT''R*\\^ < ll-m(l-— 1 \ \\RrR*\\^ +m(l- — 



Hi \ \ 777 // " 



ml -"^1 \ m 



Setting k = [mlog^l yields the result. □ 

Now we are able to prove the comparison result for SW and SB dynamics. For 
this let 5*1(5, (a, A)) := z/(cr, A) for all B G ^rc and {cr,A) G Qj, which defines 
an operator (by ([T])) that maps from H2 to Hi. The adjoint operator is then 
given by Sl{{a,A),B) := /i(5) and thus, SiS;{A,B) = S^{A,B) = /i(S) for all 
A,Be flRc- For the proof we set 

R := M-Si. 

It follows that RR* = (M - Si){M* - 5*) = MM* - S^, since M5* = 5iM* = 5^, 
but MM* (A, 5) = 1(A = 5) and thus {RR*^ = (/ - 5^)^ = / - 5^ = This 
implies pi^H^i = 1- Additionally, Pgw - 5^ = RTR* and PgB - = PTP*. 

Theorem 15. Let Psw (^resj». PsBy' be the transition matrix of the Swendsen-Wang 
(resp. single-bond) dynamics for the random- cluster model on a graph with m edges. 
Then 

A(Psw) < Smlogm A(Psb). 
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Proof. Let k = \m log — ] . Then 



A(Psw) = l-\\RrR*\\H, < l-Y^^WRT^R^W^^-e 

1 / II , II \ Coro\l3\ 1 / „, 



1- LR^IL, < 1- LRTi? 



Hi - 1 _ c V II ll-f^: 



l-e\ II 11^1/ - l-£ 

< ^(l-||i?Ti?*||,J = ^A(Psb), 

where the last inequality comes from 1 — x'^ < k{l — x) for x G [0, 1]. Setting s = ^, 
we obtian ^i- = 4:k < 8m log m. This proves the statement. 

□ 

Combining Lemma [8] and Theorem [15] proves Theorem [TJ 



5 Proof of Theorems [3] and [5 



In this section we introduce the notion of (planar) dual graphs and prove that 
the heat-bath dynamics on a planar graph G has the same spectral gap than the 
heat-bath dynamics for the dual model on G\ which is the dual graph of G. This 
immediately implies Corollary [2] and hence, that rapid mixing of the Swendsen-Wang 
dynamics for the random-cluster model and its dual model is equivalent. Finally, 
we use the known lower bounds on the spectral gap of SW on the two-dimensional 
square lattice at high temperatures to prove Theorem [3] and Theorem [5l 



5.1 Dual graphs 

Let G be a planar graph, i.e. a graph that can be embedded into a sphere S"^ 
such that two edges of G intersect only at a common endvertex. We fix such an 
embedding for G. Then we define the dual graph = (y'^,E'^) of G as follows. 
Place a dual vertex in each face, i.e. in each region of S"^ whose boundary consists 
of edges in the embedding of G, and connect 2 vertices by the dual edge if and 
only if the corresponding faces of G share the boundary edge e (see e.g. |GrilO[ 
Section 8.5]). Note that the dual graph certainly depends on the used embedding. 
It is clear, that the number of vertices can differ in the dual graph, but we have the 
same number of edges. 

Additionally we define a dual EC configuration C E'^ in G'^ to a RC state 
A C E in G by 

e e v4 et ^ A^, 
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where e-j- is the edge in E"^ that intersects e in our (fixed) embedding. (By construc- 
tion, this edge is unique.) 

It is easy to obtain (see |Gri06t p. 134]) that the random cluster models on the 
(finite) graphs G and are related by the equality 

where the dual parameter p* satisfies 

p* q{l-p) 



1 — p* p 



(8) 



The self-dual point of this relation is given by Psdiq) = i^^> which corresponds by 

p = l — e~^ to the critical temperature of the g-state Potts model /3c(?) = log(l-l-y^) 
on 7? |BD10j . For given G, q and p, we call the RC model on with parameters 
q and p* the dual model and, we write /i"!" instead of /i^Jg, if the setting is fixed. 
Using this, we obtain 



2\E\ ^ ^t((AUc)t)+^t((^\e)t) 



where we write Pj^b fo'^ heat-bath dynamics for the dual model. Since both 
transition matrices have obviously the same eigenvalues, by the above equality, this 
proves the following lemma. 



Lemma 16. Let Phb (resp. pI^^) he the transition matrix of the heat-hath dynamics 
for the random- cluster (resp. dual) model. Then 

A(Phb) = KpD- 
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Remark 17. Indeed, it is possible to prove a result similar to Lemma also for 
non-planar graphs as long they can be embedded (without intersecting edges) in 
surfaces of bounded genus. In this case, the equality becomes an inequality with a 
constant that depends exponentially on the genus. In particular, this leads to the 
same result as in Theorems [3] and [5] for Z| with periodic boundary condition, i.e. 
the square lattice on the torus. But since this would extend this article by several 
pages, we decide to refer the interested reader to [U]. 

Additionally, we know from |U1H Theorem 5] that 

A (^) < A(Psw) (9) 

for arbitrary graphs. Combining Theorem [T] with Lemma Lemma [S] and (in 
this order) proves Corollary [2l 

5.2 Application to the square lattice 

In this subsection we recall the results of |Ullj to conclude the proofs of Theorems [3] 
and [5l For this let Psw be the Swendsen-Wang dynamics for the random-cluster 
model on Z| with parameters p and q. First note that, by |Ullt Corollary 2], we 
know that there exist constants Cp, d and C such that 

A(Psw)"^ < CpAT forp<p,(g) 

and 

A(Psw)"^ < c'AT^ for g = 2andp = pc(2), 

where iV := |Vl| = L^. Here, N <m<2N unless L = 1 (m := \El\). 

Additionally, it is easy to obtain (using the most natural embedding of Z| in 
the sphere) that the dual graph Z|* of Z| is isomorph to f*, where v* 

is an arbitrary vertex in the "outer" face of the embedding of Z| and 
V* means that there is an edge between v* and every vertex of the boundary of 
Zi_i. I.e. Zi„i U5 V* = {VI, El), where VI = Vl^i U v* and El = El-i U 
{{v\ u}: u = (ui, U2) e 14-1, {ui,U2} n {1, L - 1} ^ 0}. 

Using Theorem 1' from [Ullj (and the discussion after it) we obtain that there is 
a constant Cp such that 

KPsw)~' < CpN for / <p,(g). 
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where Pg^ is the Swendsen-Wang dynamics for the RC model on Z|* with param- 
eters p* and q, which is the dual model to the RC model on Z| with parameters p 
and q whenever = ^^y^. Note that this implies p* < pc iS p > pc- Hence we 
immediately obtain Theorem [3] from Corollary [2] and Theorem [5] from Theorem [H 

6 Conclusion 

In this article we gave the first proof of rapid mixing of a Markov chain for the 
Ising (resp. Potts) model on the two-dimensional square lattice at all (resp. all 
non-critical) temperatures. Again it should be pointed out that the results of this 
paper (about mixing in Z^) are ultimately due to the numerous results for mixing 
of the single-spin dynamics for the Ising/Potts model, as stated in the introduction, 
and are therefore far from being sharp, because the Swendsen-Wang dynamics is 
believed to be (much) faster than single-spin dynamics. Here, we mainly considered 
Markov chains for the closely related random-cluster model. Surprisingly, there is 
an upper bound on the spectral gap of the Swendsen-Wang dynamics in terms of the 
spectral gap of the heat-bath dynamics that only depends on the number of edges 
of the graph, and does not require any structure of the graph. This bound is tight 
up to the logarithmic factor, e.g. for the RC model on trees. 

The proof of this uses some ideas from Markov chain comparison, but the main 
part (of this article) was to find a representation of the Markov chains on a larger 
state space and bounding the norm of the involved (Markov) operators. We guess 
that this technique can be used also in other settings, such as Markov chains in 
general state spaces, to compare Markov chains with small and large " step sizes" . 

As stated above, the proof is via comparison and it would still be interesting to 
find a direct proof of rapid mixing of the SW dynamics, especially at the critical 
temperature. 
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